Temporal fractal structures: Origin of power-laws in the world-wide Web 
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Using numerical simulations and scaling theory we study the dynamics of the world-wide Web 
from the growth rules recently proposed in Ref. [1] with appropriate parameters. We demonstrate 
that the emergence of power-law behavior of the out- and in-degree distributions in the Web involves 
\ occurrence of temporal fractal structures, that are manifested in the scale-free growth of the local 

connectivity and in first-return time statistics. We also show how the scale-free behavior occurs in 
the statistics of random walks on the Web, where the walkers use information on the local graph 
connectivity. 
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I. INTRODUCTION 



Complex evolving networks differ from static random graphs in that their size increases in time, thus impacting 
the linking process in a nontrivial manner. Hence the emergent structure of links is related to salient features of the 
growth processes in the network, which is governed by the individual linking rules [^|^] . In the case of the scale-free 
structures emerging at large evolution times in networks with preferential linking, the universality classes characterized 
by the same set of scaling exponents can be distinguished, that are based on several relevant details of the microscopic 
linking properties. The theoretical background of the universality classes of dynamic networks is still missing. By 
study of many particular networks it has been recognized that certain dynamic constraints on the linking processes 
r 7^ , can change the emergent scale-free behavior jl]-[|. 

The world-wide Web differs from the generic scale- free evolving networks in two ways: (1) it is represented by a 
directed graph and, (2) most importantly, it has a variable wiring diagram. Frequent updates of the out-going links, 
that are peculiar for conduct of the agents in the real Web, makes the wiring diagram of the Web graph changing at 
the same paste as the graph grows. Whereas, wiring diagram of some other networks changes on much slower scale 
or not at all [Q. The intimate relationship between structural and growth properties leads to a specific architecture 
of links in the Web. Recent measurements in the real Web have shown [Q] that both out- and in-degree distributions 
are power-law with different exponents, as well as the size of the connected clusters out of the giant component. 
Occurrence of power-laws is a remarkable feature in large number of complex evolving networks, that indicates 
^ presence of underlying self-organization while the network grows. The emergent hierarchical organization of node 
ranks is highly relevant for the stability of the network under attacks and for the character of other dynamic 
processes on the network, such as random walk processes Therefore understanding the mechanisms of self- 

organization that lead to power-laws in the world-wide Web is crucial both for its functional stability and for designing 
efficient search algorithms || and transport processes || on the Web graph. 

As a step towards realistic modeling of the dynamics of world-wide Web we proposed recently the model jl| which 
takes into account the basic relevant features of the Web growth: directed linking, rewiring of preexisting links, and 
bias activity of agents and bias attachment of links. It was shown |IJ that, when the degree of rewiring in the graph 
is adjusted to /3 ~ 3 (i.e., to each new added link in the graph there are in the average three updated links among 
preexisting nodes), the model reproduces fairly well the emergent power-law distributions of the out- and in-degree, 
and the scaling exponent of the connected components. In the present work we use this model to study the details of 
the growth process that precedes the emergence of link structure in the Web graph. Particularly, we demonstrate that 
• , a spatio-temporal fractal structure of linking activity occurs on the growth time scale by successive addition of nodes 
and the average increase of the number of links by M = [3 + 1 . The fractal properties of the structure are measured 
by scaling behavior of the distribution of time intervals of the first-return activity to a given node. We show that 
this activity pattern results in the algebraic increase of the average local connectivity < q K (s,t) > at node s with time 
(k refers to "out" and "in" links), implying scaling behavior in the underlying local probability distribution. In view 
of the scaling theory the local connectivity is then related to the emergent degree distribution in the limit of large 
evolution times Jlpfl t — > oo. We demonstrate these steps by directly simulating the appropriate quantities, both for 
out- and in-links. In addition, we show that dynamic processes, such as random walks on the Web graph that 
use the information on the local connectivity may also result in the power-law distributions. Here we compute (for 
the same graph parameters) the distributions of distances on node hierarchy made by an ensemble of random walkers 
which utilize parts of locally available information on the Web structure. 
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II. LINKING RULES AND FRACTAL GROWTH PATTERNS 



The growth model is defined by the dynamic rules |jj that can be summarized as follows: At each time step 
t > Mq > M add a node i = t and create M links. A link is first attempted from the new added node with probability 
a to a target node k that is selected with the probability pi n (k,t), specified below. Else, a link is created between a 
pair of preexisting nodes (updated link) as follows: A link from node n < i to target node k < i at time t — i occurs 
with the probability 



C(n, k,t) = (1 - a) Pout(n,t) x p in (k,t) 



(1) 



where both probability to select an origin of the link p ut(n,t) and to select a target node pi n (k,t) depend on the 
current connectivity of these nodes q ou t(n,t) and qi n (k,t), respectively, 

+ q ou t(n,t)/M a + q in (k,t)/M 



Pout{n,t) 



{l + a)t 



Pin 

(M) 



(l + a)t 



(2) 



At the moment of addition q ou t{h i) = Qin(h i) = and increasing in time. Therefore, a ratio of the number of added 
and updated links j3 = (1 — a) /a, which is independent on the actual number of links M, is the control parameter in 
the model. For simplicity we keep M fixed, assuming that the number of links fluctuate in time around the average 
value M. Motivation for the above linking rules are discussed in detail Q]. For /? » 3 the results of numerical 
simulations within this model (see Ref. (jjj) agree satisfactorily with the empirical data on the real Web |Q. We would 
like to stress that the property of rewiring while the graph grows, which is enabled by C(n, k,t) > in Eq. ([!]), yields 
qualitatively new scaling features, compared with the graphs with frozen links (/3 = 0). For instance, one of the 
important consequences of rewiring is the appearance of the scale-free structure of the out-degree distribution. The 
entire class of graphs generated with the above rules for varying (3 in the range < (3 < oo was studied recently . 

Here we concentrate to the growth phase of the Web graph. We fix (3 = 3 and chose M = 4. In principle, share 
between added and rewired links is statistical, thus the universal degree-distributions are independent on M in the 
scaling region ^jl]]. However, some local properties can depend on the actual growth rate of the number of links. 
To show these dependences is another goal of this work. 
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FIG. 1. Temporal patterns of activity: in-linking (crosses) and out-linking (circles) for M = 4 and network size N = t = 10 4 . 

In Fig. 1 we show the activity pattern of each node during the growth time t = 1 • • • N, with N — 10 4 nodes-steps. 
A node n can be linked only in the time after its appearance t > n. Value on the vertical axis of a point in the plain 
(k, t) represents the moment of time t when the node k got an in-link (upper-left part), or a node n that created an 
out-link at time t (lower-right part). Quantitative characterization of these patterns can be done in two ways. 

First, we notice that the time intervals between two consecutive linking activities at a given node are irregular, 
resembling a fractal set. For this reason we measure the distribution of successive time intervals At for in-linking 
and out-linking separately. The results are given in Fig. 2 (left), where the algebraic decay of the distributions 
P K (At) ~ (A£)~ M " ; , confirms the fractal character of these sets. The slopes of the two curves are fi out — 0.82 ± 0.01 
and fii n — 0.87 ±0.01. Second, fixing a node s we watch how the number of links accumulates at that node with time. 
Averaging over a large ensembles of networks, we find that for t 3> s 

<q K (s,t) >~iT» , (3) 

where k = "out" or "in", as shown in Fig. 2 (right), where the scaling exponents are ^/ ou t = 0.66 ± 0.03 and 
7 in = 0.87 ±0.03. 
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III. LOCAL CONNECTIVITY AND EMERGENT DEGREE DISTRIBUTIONS 



The average connectivity at a node increases as a power of evolution time for times t ^> s, which is compatible with 
scaling behavior of the local probability distribution p K (q, s, t) that the node s collected q links in time up to the step 
t. It was shown jnj analytically that preference linking with the probability Pi n (k,t) given in Eq. (Q) leads to the 
power-law behavior of the pi n (q, s, t) both in q and t/s arguments. Our results in Fig. 2 suggest that, due to rewiring 
with the probability C(n, k, t) in Eq. (|l|), the distribution p out (q, s, t) also exhibits scaling behavior but with different 
exponent ("font ^ 7m). The emergent degree distributions P K (q) are defined as 

P K (q) = lim YV(<z,M) ~ <T Tk , (4) 

t— >oo * — ' 

s<t 

which we extend to both out- and in- links. In addition, the exact scaling relation that applies to in- link distribu- 
tions can be easily extended to out-links jDJ], i.e., 

r K = 1/ 7 k + 1 • (5) 
Here we assume that the general scaling form applies both for in- and out-links 

Pti (q, S ,t)~(s/trf(q*( S /t) A ) , (6) 

with conserved number of links of both kind, i.e., ^2 q p K (q, s,t) = 1 where k — "in" , "out" . Then together with Eqs. 
®"(@) we nn d P- = A/a; = 7 and t = (1 + j)x/A, leading to Eq. (||). The measured distributions of emergent node 
ranks F m (g) and P ou t(q) after N = 10 5 added nodes are shown in Fig. 3. The slopes r out — 1 = 1.70 ± 0.03 and 
Tin — 1 = 1-26 ± 0.02 obey the scaling relation (J5J) with the respective values for 7 out and 7^ taken from average 
connectivity in Fig. 2. 
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FIG. 2. Left panel: Distributions of return-time for out- and in-linking. Right panel: Average connectivity at node s = 10 
vs. evolution time t. N = 10 4 , M — 4, data log-binned, bin ratio 1.2. 




FIG. 3. Cumulative distributions of node degrees for out- and in-links after t = N = 10 5 evolution steps. M = 1, log-binning 
ratio 1.1 . Inset: Corresponding scaling exponents vs. 1/7V. 
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IV. WALKER ON THE WEB: LOCAL STRUCTURES 



We have demonstrated that evolution of local organization both of out- and in- links at individual nodes is responsible 
for the global scaling of the emergent node distributions for large times and number of nodes. Here we would like to 
show that dynamic processes on grown network (i.e., at different time scale) which use the information on these local 
properties also obey certain scaling laws. Such processes on the Web are different kinds of random walks related, for 
example, to search algorithms. We define two types of random walks MM: The adaptive random walk (ARW) that 
selects the target node ki with the weight which is proportional to in-linking probability of the visited node, and a 
naive random walk (NRW) selecting one of the out links with equal probability. The corresponding weights are 



w A Rw(n,ke) =p ln (fa)/ 



w(n) 



{n,ke) , w NRW (n,k e ) = l/q ou t{n) 



(7) 



In Fig. 4 we show the distributions of distances in hierarchy levels Aq K inside the clusters of connected nodes which 
are visited in cumulative time by an ensemble of walkers. As the Fig. 4 shows, these local clusters are organized 
scale-free structures with W(Aq K ) ~ (Aq K )~ SK . Notably the scaling exponents 8 out « Si„ expressing the correlations 
due to normalization of weights in Eq. (|?j) . For the case of ARW the exponents are close to r in of the global structure 
of in-links, whereas for the clusters scanned by NRW they are reduced by approximately unity, i.e., 2.07 ± 0.04 and 
1.10 ± 0.03 for ARW and NRW, respectively. 
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FIG. 4. Time-integrated distributions of distances of out- and in-degree Aq, made by adaptive (ARW) and naive (NRW) 
walkers on the Web graph. N — 10 4 , M = 4, log-binning ratio 1.1 . 



V. CONCLUSIONS 



The probabilistic character of linking rules for a < 1 with self-consistently varying linking probabilities leads to 
power-law behavior of local and global (emergent) link structures, both for out- and in-links. This numerical results 
(see also references ||J@,0|) are in agreement with the analytical results obtained by rate equation approach in the 
same model [llj. We have demonstrated here that the basis of these scaling laws lies in the occurrence of dynamic 
fractals and hence the algebraic growth of the local connectivity in time, from which then the hierarchical global 
structure emerges at large times. 

In addition, such local connectivity affects random-walk processes on grown networks. The connected clusters 
(subgraphs) scanned by the random-walk ensembles also exhibit scaling behavior of distances in node degrees. The 
scaling properties of these subgraphs on the Web strictly depend on the applied random walk strategy, i.e., in the degree 
of information about local connectivity that the walkers use. Moreover, the scaling exponents of the distributions of 
distances in these subgraphs on the Web decrease with increasing rate M, in contrast to the global structure of the 
graph, which is universal for large network size N. 
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